STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling Supplementary Material
نویسندگان
چکیده
In the supplementary material, we present the analysis of semantic boundary accurary in Section 1. In section 2, we evaluate the oracle performance on NYUDv2 40-class task with our spatio-temporal data-driven pooling. In section 3, we analyze the groundtruth annotations of the NYUDv2 40class task. In section 4, we provide the qualitative results of the semantic segmentation results of the NYUDv2 4-class and 13-class tasks. In section 5, we provide more qualitative examples of the semantic segmentation results of the NYUDv2 40-class task. In section 6, we show some failure cases which do not achieve better performance than FCN.
منابع مشابه
RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling
Beyond the success in classification, neural networks have recently shown strong results on pixel-wise prediction tasks like image semantic segmentation on RGBD data. However, the commonly used deconvolutional layers for upsampling intermediate representations to the full-resolution output still show different failure modes, like imprecise segmentation boundaries and label mistakes in particula...
متن کاملSTFCN: Spatio-Temporal FCN for Semantic Video Segmentation
This paper presents a novel method to involve both spatial and temporal features for semantic segmentation of street scenes. Current work on convolutional neural networks (CNNs) has shown that CNNs provide advanced spatial features supporting a very good performance of solutions for the semantic segmentation task. We investigate how involving temporal features also has a good effect on segmenti...
متن کاملTemporal Semantic Motion Segmentation Using Spatio Temporal Optimization
Segmenting moving objects in a video sequence has been a challenging problem and critical to outdoor robotic navigation. While recent literature has laid focus on regularizing object labels over a sequence of frames, exploiting the spatio-temporal features for motion segmentation has been scarce. Particularly in real world dynamic scenes, existing approaches fail to exploit temporal consistency...
متن کامل3D Semantic Parsing of Large-Scale Indoor Spaces Supplementary Material
The supplementary material contains: (I) most importantly, a video that shows detailed and comprehensive results, as well as a PDF which includes (II) more experimental results in terms of PR curves, sample RGB-D detection results, and comparison to the baselines, (III) a few example real-world applications of large-scale semantic parsing, (IV) implementation details of CRF to enforce contextua...
متن کاملA Method for Modeling and Segmentation of Spatio-Temporal Shapes
This paper presents a method for modeling and segmenting spatio-temporal shapes. The modeling part is based on obtaining a description of the statistical variations of spatio-temporal shape parameters by studying a representative training set of examples. A deformable model of spatio-temporal shapes is used for segmenting similar shapes in new image sequences. The deformations of the model are ...
متن کامل